Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data structures via the cli supporting docs #1017

Draft
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

jamessnowplow
Copy link

work in progress

Copy link

netlify bot commented Sep 26, 2024

Deploy Preview for snowplow-docs ready!

Name Link
🔨 Latest commit 1c346b5
🔍 Latest deploy log https://app.netlify.com/sites/snowplow-docs/deploys/670f9d1cf4df890008a392b0
😎 Deploy Preview https://deploy-preview-1017--snowplow-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Copy link
Contributor

@gleb-lobov gleb-lobov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good overall, but I think we don't communicate anywhere that you need to edit the fiels in place, instead of creating new files for each version. It's kinda implied by names not having the versions, but I think we still want to mention it somewhere. But maybe the recipe will have it

sidebar_position: 2
sidebar_custom_props:
offerings:
- bdp
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should cloud be included as well?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as far as i can tell this is just how the docs differentiate between BDP and Community. There are no other options.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe I'm mixing something up, or the docs were changed since the last time I've done anything there


1. Is each file a valid format (yaml/json) with expected fields
2. Does the schema in the file conform to [snowplow expectations](/docs/understanding-your-pipeline/schemas/#the-anatomy-of-a-schema)
3. Given the organization's [loading configuration](http://localhost:3000/docs/storing-querying/loading-process/) will any schema version number choices have a potentially negative effect on data loading
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Probably link needs host update


* A deployed Snowplow BDP pipeline
* [snowplow-cli](/docs/understanding-tracking-design/managing-your-data-structures/cli/#download) downloaded and configured
* A familiarity with [git](https://git-scm.com/) and an understanding of [github workflows](https://docs.github.com/en/actions/writing-workflows)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the product is called github actions, but Im not sure what's the difference

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought that too but their docs are a bit confusing. They refer to workflows in a lot of places then actions in their more detailed stuff.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't know, need some marketing people. I would use actions since it's a more recognisable name

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As far as I can tell, actions is the platform and workflows are the things you create. Yeah, I can change it to actions to avoid anyone who hasn't read the definitions getting confused.


- run: curl -L -o snowplow-cli https://github.com/snowplow-product/snowplow-cli/releases/latest/download/snowplow-cli_linux_x86_64 && chmod u+x snowplow-cli

- run: ./snowplow-cli ds publish dev --managed-from $GITHUB_REPOSITORY
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we move the --managed-from to be an env variable as well?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe? I don't think I feel strongly. It kind of does fit with things like org id, yeah, id be ok with it either way

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it makes sense, it makes the command a bit cleaner

Copy link

@cksnp cksnp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have a few small suggestions (and I think a typo fix or two).
Before we merge to prod I need to coordinate with marketing, support, etc. So let's have it ready without merging.

docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
docs/recipes/recipe-data-structures-in-git/index.md Outdated Show resolved Hide resolved
Copy link

@cksnp cksnp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LG! As said, let's wait for 👍 by marketing etc before merging

@johnmicahreid
Copy link
Contributor

@jamessnowplow can we add an explicit example where you request someone else's approval on a new data structure e.g. a data team member? I know it's implied by managing stuff via Github, but it would be really good to call out an approval workflow specifically (e.g. where Contributor A does not have merge approval and Reviewer B does)

@jamessnowplow
Copy link
Author

@johnmicahreid I'm a bit worried about accidentally writing more about github setup and workflows than snowplow-cli.

What level of knowledge are we assuming here? Would the example include how to setup approval workflows and branch rules and prs in github? Or try to capture the essence in a series of screenshots or something?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants